Compression of parallel texts
نویسندگان
چکیده
منابع مشابه
Compression of Parallel Texts
The world-wide use of digital storage and communications devices is increasing the need to make texts available in multiple languages. To minimise the cost of storing and transmitting multiple translations of a text, one could store the text in just one language, from which other translations can be created. Unfortunately, the quality of machine translation techniques is not good enough for thi...
متن کاملStatic Compression for Dynamic Texts
Two problems arise when semi-static word-based compression methods are applied to large texts, such as those stored in information retrieval systems. First, the space required for the model during decoding can become very large. Second, the need to handle document insertions means that the collection must be periodically recompressed if compression eeciency is to be maintained. Here we show tha...
متن کاملSructural Matching of Parallel Texts
This paper describes a method for finding struc-rural matching between parallel sentences of two languages, (such as Japanese and English). Parallel sentences are analyzed based on unification grammars, and structural matching is performed by making use of a similarity measure of word pairs in the two languages. Syntactic ambiguities are resolved simultaneously in the matching process. The resu...
متن کاملEfficient Compression of Graphics and Texts
In the paper is presented a new method for efficient lossless documents image compression, based on the Inverse Difference Pyramid (IDP) decomposition. The method is aimed at the processing of graphic and text grayscale and color images. The IDP decomposition is presented in brief and the method ability to process efficiently different kinds of images are presented. In the paper are included th...
متن کاملAligning Clattses in Parallel Texts
This paper describes a method for the automatic alignment of parallel texts at clause level. The method features statistical techniques coupled with shallow linguistic processing. It presupposes a parallel bilingual corpus and identifies alignments between the clauses of the source and target language sides of the corpus. Parallel texts are first statistically aligned at sentence level and then...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information Processing & Management
سال: 1992
ISSN: 0306-4573
DOI: 10.1016/0306-4573(92)90068-b